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[0001] In a parallel-processing computer system in which a plurality of 

processors are interconnected via a connecting network, the present invention relates to 
scheduling the run of parallel processing in an information processing system that 
executes parallel processes of parallel programs operating on the plurality of processors 
in a time-sharing-based multiplex manner. 
Background of the Invention 

[0002] A parallel-processing computer system is typically formed of a 

plurality of processors that are interconnected by a connecting network. In such a 
system, parallel programs are run on the plurality of processors, and each program is 
divided into multiple parallel processes which are allocated to the separate processors 
and operate in parallel. Data exchange via the connecting network takes place among the 
processes operating in parallel on the plurality of processors and thereby the processes as 
a whole produce results. 

[0003] When the parallel processes that are executed in this way are 

running, they are checked to ensure that they are synchronized at certain-interval 
checkpoints. When one process is running in conjunction with other remote processes 
and a checkpoint is reached, if any remote process does not reach the check point, data 



exchange with the remote process is impossible and what is called a wait for 
synchronization occurs. 

[0004] When a plurality of parallel programs are executed simultaneously, 

the entire computer system is used in a time-sharing manner. This method is explained 
in "Parallel Operating System" by Akira Fukuda, pp. 55-56 (published by Korona-sha, 
1997) (ISBN4-339-0258-9). When a plurality of parallel programs are executed in a 
time-sharing-based multiplex manner in the parallel-processing computer system, it is 
necessary to schedule the run of parallel processes on each processor and switch to the 
next process. 

[0005] If scheduling the run of parallel processes is performed without 

coordinating the process-run steps of the plurality of processors, due to the run time 
difference among the parallel processes executed on the processors, a problem arises in 
that the wait time for synchronizing the parallel processes increases. To avoid such an 
increase in the wait time for synchronization, in scheduling the run of the parallel 
processes of parallel programs, it is necessary to coordinate the process-run steps of the 
processors so that the parallel processes of one of the parallel programs will be executed 
simultaneously and in turn. 

[0006] There are some methods of such scheduling for coordinating the 

processors, one of which is described in (1) "Parallel Operating System" by Akira 
Fukuda, pp. 56-62 (published by Korona-sha, 1997) (ISBN4-339-0258-9). This parallel 
process run scheduling method is outlined as follows. The parallel processes of parallel 
programs are simultaneously executed during a time slice. When the time slice expires, a 
global scheduler is called. The scheduler selects the next process to be executed and 
notifies the processors of that process. 
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[0007] Another parallel process run scheduling method is disclosed in (2) 

JP-A-74 150/1 998. This scheduling method is outlined as follows. To execute the 
parallel processes of parallel programs, a program allocation table is created, wherein the 
start time and the end time of a given period for a process run are predetermined. After 
being notified of the information specified in this table, the processors execute the 
processes according to that information. 
Summary of the Invention 

[0008] In the above-described scheduling methods, the step of notifying 

the processors of necessary information is required, each time switching to the next 
process is performed in the case of the above-mentioned reference (1) and each time a 
program allocation table is created or the table is updated in the case of the above- 
mentioned reference (2). 

[0009] Therefore, in addition to the processing performed by each 

individual processor, management processing is required for sending such notification 
from one processor to other processors. The overhead time due to this management 
processing is a drawback of the conventional scheduling methods of coordinating the 
processing steps of the processors. 

[0010] The object of the present invention is to provide a parallel process 

run scheduling method which does not require explicit processor-to-processor 
communication, thus reducing the overhead due to scheduling by coordinating the 
processing steps of the processors. 

[001 1] To achieve the above object, and in accordance with a first aspect 

of the present invention, a parallel process run scheduling method is provided to be used 
in an information processing system comprising a plurality of processors on which a 
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plurality of parallel programs, each consisting of an equal number of parallel processes, 
are run in a time-sharing-based multiplex manner. 

[0012] In this parallel process run scheduling method, all allocated 

parallel processes on each processor are executed in an order as determined according to 
the value of an integrating counter on each processor that increments over time in 
synchronization with all other processors and the number of the parallel programs to run 
concurrently in the system, thereby simultaneously executing the parallel processes of 
one of the parallel programs in turn on the processors. 

[0013] In accordance with a second aspect of the present invention, a 

parallel process run scheduling method is provided to be used in an information 
processing system comprising a plurality of processors on which a plurality of parallel 
programs, each consisting of the discrete number of parallel processes, are run in a 
time-sharing-based multiplex manner. 

[0014] In this parallel process run scheduling method, all allocated 

parallel processes on each processor are executed in an order as determined according to 
the value of an integrating counter on each processor that increments over time in 
synchronization with all other processors and the number of the parallel programs to run 
concurrently in the system, wherein, if the number of the parallel processes of a parallel 
program is less than the number of processors to be used for running the parallel program 
in the time-sharing-based multiplex manner, some of the processors to which no parallel 
process of the parallel program is allocated execute a dummy process allocated instead 
when the parallel program is executed, thereby simultaneously executing the parallel 
processes of one of the parallel programs in turn on the processors. 

[001 5] In accordance with a third aspect of the present invention, a 
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parallel process run scheduling device is provided to be used in an information 
processing system comprising a plurality of processors on which a plurality of parallel 
programs, each consisting of the equal number of parallel processes, are run in a 
time-sharing-based multiplex manner. This device to be provided on each processor 
comprises an integrating count means, the count value of which increments over time in 
synchronization with all other processors; a process number count means to store the 
number of processes to be run on the processor, corresponding to the number of parallel 
programs to be run in the system; a means for generating a queue of processes to be 
executed; a process queue buffer means for storing the generated process queue; and a 
means for determining a process to be executed out of the processes in the process queue, 
according to the count value of the integrating count means and the number of processes 
retained in the process number count means. 

[0016] In accordance of a fourth aspect of the present invention, a parallel 

process run scheduling device is provided to be used in an information processing system 
comprising a plurality of processors on which a plurality of parallel programs, each 
consisting of the discrete number of parallel processes, are run in a time-sharing-based 
multiplex manner. 

[0017] This device to be provided on each processor comprises an 

integrating count means, the count value of which increments over time in 
synchronization with all other processors; a processes number count means to store the 
number of processes to be run on the processor, corresponding to the number of parallel 
programs to run in the system; a means for generating a queue of processes to be 
executed, wherein, if the number of processes allocated to the processor for executing 
them is less than the number of processes retained in the processes number count means, 
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the means for generating a queue generates a queue of processes including as many 
dummy processes as required to fill up the difference in the number of processes of both, 
whereas if the number of processes of both agrees, the means for generating a queue 
generates a queue of processes not including dummy processes; a process queue buffer 
means for storing the generated process queue; and a means for determining a process to 
be executed out of the processes in the process queue, according to the count value of the 
integrating count means and the number of processes retained in the processes number 
count means. 

[001 8] Other and further objects, features and advantages of the invention 

will appear more fully from the following description. 
Brief Description of the Drawings 

[0019] A preferred form of the present invention illustrated in the 

accompanying drawings in which: 

[0020] FIG. 1 is a diagram illustrating the transition of the processes to be 

executed on each processor over time, according to the parallel process run scheduling 
method of the present invention; 

[0021] FIG. 2 is a block diagram showing an exemplary system of 

parallel-processing computers in accordance with a preferred embodiment of the 
invention; 

[0022] FIG. 3 is a schematic process flow diagram of action applied to 

parallel programs in a preferred embodiment of the invention; 

[0023] FIG. 4 is a diagram illustrating an example of the transition of the 

processes to be executed on each processor over time in the first embodiment of the 
invention; 
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[0024] FIG. 5 is a diagram illustrating an example of the transition of the 

processes to be executed on each processor over time, after the parallel processes of 
parallel programs are generated and allocated in accordance with the scheduling method 
of the first embodiment of the invention; 

[0025] FIG. 6 is a diagram illustrating an example of the transition of the 

processes to be executed on each processor over time in accordance with the scheduling 
method of a second embodiment of the present invention; 

[0026] FIG. 7 is a diagram illustrating an example of the process queue 

when the number of processes is two in the first embodiment of the invention; 

[0027] FIG. 8 is a diagram showing an example of the process queue 

when the number of processes is three in the first embodiment of the invention; 

[0028] FIG. 9 is a diagram showing an example of the process queue 

when the number of processes is two in the second embodiment of the invention; 

[0029] FIG. 10 is a flowchart illustrating the processing of a 

process-executing entity in a preferred embodiment of the invention; 

[0030] FIG. 1 1 is a flowchart illustrating the processing of a process 

queue manager in a preferred embodiment of the invention; 

[003 1] FIG. 12 is a flowchart illustrating the processing of a parallel 

program manger in the first embodiment of the invention; and 

[0032] FIG. 13 is a flowchart illustrating the processing of the parallel 

program manger in the second embodiment of the invention. 
Description of the Preferred Embodiments 

[0033] First of all, the principle embodying the present invention will be 

explained. 
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[0034] In accordance with a preferred embodiment of the invention, a 

parallel-processing computer system comprising a plurality of processors interconnected 
via a connecting network is assumed. In such a system, when a plurality of parallel 
programs, each consisting of an equal number of parallel processes, are executed in a 
time-sharing-based multiplex manner, the following scheduling method is applied to 
execute the processes. 

[0035] The processors execute the allocated parallel processes as 

determined by a function f (c, pn) per time slice for each processor during a time slice. In 
this function, a value of c is given by the integrating counter that each processor has and 
a value of pn is the number of processes to be executed on each processor, that is, 
corresponding to the number of parallel programs running concurrently. The function f 
(c, pn) is uniquely determined, depending on the value of c and the value of pn. The 
function f (c, pn) is used such that when the same value of c and the same value of pn are 
given for all processors, the parallel processes of one of the parallel programs are 
determined to run on the processors. 

[0036] The above-mentioned integrating counters on all processors in the 

system are initialized to a value of the same origin when the system reboots and 
increment over time synchronously for all processors. At any given time, all processors 
are given the equal value of the integrating counter c and the equal number of parallel 
processes pn and the processes determined by f (c, pn) are the parallel processes of one of 
the parallel programs. As a consequence, the parallel processes to be executed on the 
processors are sequenced correspondingly. Thus, the parallel processes of one of the 
parallel programs are executed simultaneously and in turn. 

[0037] In the way described above, the overhead due to scheduling by 
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coordinating the process-run steps of the processors can be reduced and explicit 
processor-to-processor communication can be made unnecessary. 

[0038] When a plurality of parallel programs, each consisting of a discrete 

number of processes, are executed in the time-sharing-based multiplex manner, multiple 
processors are shared for running the programs. When a processor is not executing a 
parallel process of one of the parallel programs, it executes a dummy process 
simultaneously with the parallel processes of the parallel program. In this way, the 
number of processes executed on each processor is made equal for all processors and the 
above parallel process run scheduling method for simultaneously executing the parallel 
processes of one of the parallel programs in turn is applied. 

[0039] In the way described above, even if a plurality of parallel 

programs, each consisting of a discrete number of processes, are executed in a 
time-sharing-based multiplex manner, the overhead due to scheduling by coordinating 
the process-run steps of the processors can be reduced and explicit 
processor-to-processor communication can be made unnecessary. 

[0040] FIG. 1 is a diagram outlining the parallel process run scheduling to 

be carried out by the present invention. In this diagram, two parallel programs, one 
consisting of two parallel processes Al and A2 and the other consisting of two parallel 
processors Bl and B2, are executed on processors PI and P2. The processor PI is 
responsible for executing the processes Al and Bl, whereas the processor P2 is 
responsible for executing the processes A2 and B2. At times Tl to T4, the processors 
execute processes, Al and A2, Bl and B2, Al and A2, and Bl and B2 as determined by f 
(cl, pn 1), f (c2, pn2), f (c3, pn3), and f (c4, pn4), respectively. Hereupon, the values of 
cl to c4 are given by the integrating counter of each processor at times Tl to T4, 
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correspondingly, and the values of pnl to pn4 are the number of processes to be executed 
by each processor at times Tl to T4. These values are equal for the processors PI and 
P2. 

[0041] As represented in FIG. 1, the parallel processes of one of the 

parallel programs are executed simultaneously in turn. A preferred embodiment of the 
present invention will be explained below. 

[0042] FIG. 2 shows an exemplary system of parallel-processing 

computers representing one preferred embodiment of the invention. The 
parallel-processing computers system 1000 of the present invention comprises N 
processors PI to PN 2000 in a group interconnected by an connecting network 3000. 
Each of the processors PI to PN is equipped with an integrating counter 1, a process 
number counter 2, a process queue 3, a process queue manager 4, and a 
process-executing entity 5. In addition, part of or all the processors are equipped with a 
parallel program manager 6. 

[0043] The integrating counter 1 increments over time in synchronization 

with all other processors, and all the integrating counters of all processors are initialized 
to a value of the same origin when the system reboots. As an example of such a counter, 
a time register that each processor has is disclosed in JP-A-28191 1/1995. 

[0044] The process number counter 2 is used to store the number of 

processes to be executed by each processor. This number equals the number of parallel 
programs running concurrently in a time-sharing manner in the system. The value of this 
counter may increase or decrease by the processing of the process queue manager 4. 

[0045] The process queue 3 is used to store the processes to be executed 

by each processor, which will be described later. Examples of the process queue 3 are 
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given in FIGS. 7, 8, and 9. An identifier that is unique within the system is determined 
for one parallel program and is assigned to the parallel processes of the same parallel 
program. The processes placed in the process queue 3 are arranged according to the 
sequence of the identifiers assigned to the processes. Specifically, the parallel processes 
of one of the parallel programs are placed in the process queues 3 of the processors in the 
corresponding position of order. In the present embodiment of the invention, the 
identifiers are assigned by the parallel program manager 6, and an example thereof will 
be described later. 

[0046] FIG. 3 is a schematic representation of the process flow for 

generating, executing, and deleting parallel programs in the embodiment of the present 
invention. 

[0047] A request for activating a parallel program 3 1 from a system user 

is handled by any one of the parallel program managers 6 in the system. The parallel 
program manager 6 issues a request for generating a parallel process 32 of the parallel 
program to each processor. The process queue manager 4 of each processor enqueues the 
parallel process allocated in accordance with the process generation request into the 
process queue 3 and increments the processes number counter 2 (33). The parallel 
processes of the parallel program activated by the user, thus stored in the process queues 
3 of the processors by the process queue managers 4, are executed by the 
process-executing entities 5. For each time slice, each process-executing entity 5 
determines one of the processes stored in the process queue 3, according to the value of 
the integrating counter 1 and the value of the process number counter 2, and executes that 
process during the time slice (34). 

[0048] The process-executing entities 5 of the processors repeat process 



execution in a time-sharing manner until all parallel processes of a parallel program have 
been completed. Upon the completion of all parallel processes (35) of a parallel 
program, the parallel program manager 6 issues a request for removing the parallel 
process of the parallel program to each processor. In accordance with the request for 
removal, the process queue manager 4 of each processor removes the parallel process 
from the process queue 3 and decrements the process number counter (37). 

[0049] The first embodiment of the present invention will be further 

explained with reference to FIGS. 2, 4, 5, 7, 8, 10, 11, and 12. 

[0050] The parallel processes of parallel programs are executed by the 

process-executing entities 5 of the processors in the parallel-processing computer system 
1000. If a time slice expires during the execution of a process, the execution of the 
process is stopped; and, a process is determined as the next to be executed, that next 
process is retrieved from the process queue 3, according to the integrating counter 1 and 
the processes number counter 2, and the next process is executed. 

[005 1] Because each process-executing entity 5 determines a process to 

be executed per time slice and executes the process, a sequence in which the parallel 
processes of the parallel programs are executed is determined by each process-executing 
entity 5 of each processor. 

[0052] FIG. 10 is a flowchart illustrating the processing of the 

process-executing entity 5, which will be explained below. This processing is activated 
by a timer interrupt that the system generates periodically. By way of example, it is 
assumed that timer interrupts take place at intervals of 10 ms and the period of a time 
slice is 1 00 ms. 

[0053] In step 1 0 1 , the process-executing entity judges whether the time 
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slice period has expired when the local processor is executing a process. If the period has 
not expired, the process-executing entity terminates the processing. If the period has 
expired, the process-executing entity goes to step 102. 

[0054] In the step 102, the process-executing entity determines a process 

to execute, retrieves the process from the process queue 3, by referring to the integrating 
counter 1 and the processes number counter 2, then goes to step 103. To give an 
example of how to determine a process to be executed, the process to execute is 
determined to be the n-th process from the beginning of the process queue, where n is 
obtained by "(the value of integrating counter 1/time slice) mod the value of processes 
number counter 2." Hereupon, "P/Q" means to "divide P by Q and obtain a quotient" and 
"R mod S" means to "divide R by S and obtain a remainder." The processing of the step 
102 is exclusive with the processing of the process queue manager 4. 

[0055] In the step 103, the process-executing entity judges whether switch 

processing to the next process to be executed is required. If switch processing is 
required, the process-executing entity goes to step 104. If not, the process-executing 
entity terminates the processing. 

[0056] Judgment as to whether switching to the next process is required is 

made, depending on whether the process determined in the step 102 is the same one that 
was being executed during the time slice that has just now expired. In the step 104, the 
process-executing entity switches the context to the process determined in the step 102 
and starts to execute the process. After executing the process, the process-executing 
entity terminates the processing. 

[0057] In the way described above, for a plurality of parallel programs, 

each consisting of an equal number of parallel processes, parallel processes are allocated 

13 



to each processor, and the process-executing entity 5 determines one of these processes to 
execute per time slice and executes the process. In other words, the process-executing 
entity 5 of each processor determines a sequence in which the queued processes are 
executed in time slices so that all parallel processes of one of the parallel programs are 
executed simultaneously and in turn. 

[0058] FIG. 1 1 is a flowchart illustrating the processing of the process 

queue manager 4 5 which will be explained below. 

[0059] This processing is activated by process generation or removal 

carried out by any parallel program manager 6 in the system. If a plurality of parallel 
program managers 6 perform process generation or removal of multiple programs at the 
same time, the process queue manager 4 sequentially performs the processing required 
for the multiple programs, and this processing is exclusive with other processing. Even if 
the process-executing entity 5 is determining a process to execute, the process queue 
manager operates exclusively with the process-executing entity 5. 

[0060] In step 1 1 1, the process queue manager judges whether to do 

process generation or removal. For process generation, the process queue manager goes 
to step 1 12. For process removal, the process queue manager goes to step 1 14. 

[0061] In the step 1 12, the process queue manager enqueues the newly 

allocated process of a parallel program to be activated into the process queue 3 so that the 
processes in the queue will be sequenced by the identifiers assigned to them, then the 
processing goes to step 113. 

[0062] In the step 1 13, the process queue manager increments the process 

number counter 2 by one and terminates the processing. In the step 1 14, the process 
queue manager removes the allocated process of a parallel program to be deactivated 



from the process queue 3, then the processing goes to step 115. 

[0063] In the step 1 15, the process queue manager decrements the process 

number counter 2 by one and terminates the processing. 

[0064] FIG. 12 is a flowchart illustrating the processing of the parallel 

program manager 6, which will be explained below. 

[0065] The parallel program manager 6 is activated when a user of the 

system issues a request for generating a parallel program or on the completion of all 
parallel processes of one parallel program. If requests for generating a plurality of 
parallel programs are issued to the same parallel program manager 6, the multiplex 
processing of these requests will take place concurrently in the parallel program manger 
of the processor. 

[0066] In step 121, the parallel program manger judges whether to 

generate or remove a parallel program. For generation, the parallel program manager 
goes to step 122. For removal, the parallel program manager goes to step 124. 

[0067] In the step 122, the parallel program manager determines an 

identifier that is unique within the system for the parallel program to be generated and 
assigns the identifier to the parallel processes of the parallel program. Then, the parallel 
program manager goes to step 123. 

[0068] As an example, the following identifier scheme is used. The 

parallel program manager 6 provided on each processor Pi (where i = 1 to N) assigns a 
number staring with 0 up to 9999 to a parallel program. This number assignment shall 
not be duplicated for the parallel programs being run, generated by the same parallel 
program manager 6. Each parallel program manager 6 assigns the above number plus 
"10000 X i" to the parallel processes of the parallel program as their identifier. 



[0069] In the step 123, in accordance with the request for parallel program 

generation, the parallel program manager generates and allocates the parallel processes of 
the parallel program to the processors to execute them, then terminates the processing. 

[0070] In the step 124, the parallel program manager requests all 

processors that have completed the allocated process of the parallel program to remove 
the parallel process of the parallel program from the local queue, then terminates the 
processing. In consequence, the parallel processes of the parallel program executed on 
all processors will be removed. 

[0071] FIGS. 4 and 5 are diagrams giving examples of the transition of 

the processes to be executed on each processor over time when the parallel processes of a 
plurality of parallel programs are executed in a time-sharing manner in the first 
embodiment of the invention. In these examples, the parallel programs are run on two 
processors PI and P2. It is assumed that the time slice period is 100 ms and the time 
register is used as the integrating counter. 

[0072] In the example of FIG. 4, two parallel programs 71 and 72 run, one 

consisting of two processes Al and A2 and the other consisting of two processes Bl and 
B2. The allocated processes are placed in the process queue 3 on each processor as 
shown in FIG. 7: processes Al and Bl are placed in the process queue 3 of processor PI ; 
and processes A2 and B2 are placed in the process queue 3 of processor P2. 

[0073] The processes Al and A2 are assigned identifier x and the 

processes Bl and B2 identifier y, placed in the process queues 3 of the processors. 
Because the processes in the process queues 3 are sequenced by identifiers x and y 
assigned to them, the parallel processes of one of the parallel programs are in the 
corresponding position of order in the process queues 3 of the processors PI and P2. In 



this example, the processes are arranged, according to the sequence of identifiers x and y. 

[0074] Returning to FIG. 4, at time 41 , the process to be executed is 

determined by "(0/100) mod 2 = 0." Therefore, the 0th processes from the beginning of 
the process queues, namely, processes Al and A2 of the parallel program 71 are executed 
on the processors PI and P2, respectively. 

[0075] Similarly, the processes to be executed are determined and 

executed on the processors as follows. At time 42, processes Bl and B2 of the parallel 
program 72 are executed. At time 43, processes Al and A2 of the parallel program 71 
are executed again. At time 44, processes Bl and B2 of the parallel program 72 are 
executed again. 

[0076] FIG. 5 is a diagram giving an example of the transition of the 

processes to be executed on each processor over time, wherein a new parallel program is 
generated and allocated when the parallel processes of two programs are executed in a 
time-sharing manner, according to the parallel processes run scheduling method of the 
present invention, all programs consisting of the equal number of parallel processes. The 
parallel processes of a parallel program are not always generated or removed 
simultaneously on all processors assigned to execute them. In the example of FIG. 5, the 
number of processes being run on the processor PI is three, whereas, that number on the 
processor P2 is two. 

[0077] Because it may temporarily happen that the number of running 

parallel processes of the parallel program is different between the processors, the 
sequence of processes to be run changes and simultaneous running of the parallel 
processes of the same parallel program becomes impossible momentarily. However, the 
simultaneous running of the parallel processes of the same parallel program eventually 
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recovers soon without initiating communication between the processors, as will be 
described in the following example. 

[0078] Assuming that three parallel programs 71, 72, and 73 are running, 

the first one consisting of two processes Al and A2, the second one consisting of two 
processes Bl and B2, and the third one consisting of two processes CI and C2, the 
process queues of the processors are as shown in FIG. 8 by way of example. In the 
process queue 3 of processor PI, processes Al, Bl , and CI are placed. In the process 
queue 3 of processor 2, processes A2, B2, and C2 are placed. The processes Al and A2 
are assigned identifier x, the processes Bl and B2 identifier y, and the processes CI and 
C2 identifier z, placed in the process queues 3 of the processors. Because the processes in 
the process queues 3 are sequenced by the identifiers x, y, and z assigned to them, the 
parallel processes of one of the parallel programs are in the corresponding position of 
order in the process queues 3 of the processors PI and P2. In this example, the processes 
are arranged, according to the sequence of identifiers x, y, and z. 

[0079] In the example of FIG. 5, at time 5 1 , three processes Al , B 1 , and 

CI are placed in the process queue of processor PI, as shown in FIG. 8, whereas two 
processes A2 and B2 are placed in the process queue of processor P2, as shown in FIG. 7. 
At time 51, the process to be executed is determined by "(500/100) mod 3 = 2" on the 
processor PI, and, therefore, process CI of the parallel program 53 is executed. 

[0080] On the processor P2, the process to be executed is determined by 

"(500/100) mod 2 = 1" and process B2 of the parallel program 72 is executed. 

[008 1 ] At time 52, the number of running processes becomes three for 

both processors PI and P2 and the process queues of the processors are set in the states 
shown in FIG. 8. 
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[0082] At time 52, the process to be executed is determined by "(600/100) 

mod 3 = 0", and, therefore, processes Al and A2 of the parallel program 71 are executed 
on the processors PI and P2, respectively. 

[0083] On the processors PI and P2, similarly, processes Bl and B2 of 

the parallel program 72 are executed at time 53 and processes CI and C2 of the parallel 
program 73 are executed at time 54, respectively. 

[0084] Even if such temporary conditions in which the number of running 

processes is different between the processors are caused by the generation or removal of 
parallel processes of a parallel program, the simultaneous running of the parallel 
processes of the same parallel program can be scheduled soon without initiating 
communication between the processors in the way described above. 

[0085] In second embodiment of the present invention, it is assumed that 

parallel programs run, each consisting of a discrete number of parallel processes. In this 
case, when the parallel program manager 6 generates and allocates parallel processes of a 
parallel program, additionally, it generates dummy processes. The dummy processes are 
allocated to processors that are not to assigned execute any parallel process of the parallel 
program and executed simultaneously with the parallel processes of the parallel program. 

[0086] With reference to FIGS. 2, 6, 9, and 13, the second embodiment of 

the invention will be explained below. 

[0087] The process queue manager 4 and the process-executing entity 5 

perform the same processing as in the first embodiment. Specifically, as is the case in 
the first embodiment, for parallel programs, each consisting of a discrete number of 
parallel processes, the sequence in which their processes are executed is determined by 
the process-executing entity 5 of each processor, so that all parallel processes of one of 



the parallel programs will be executed simultaneously and in turn. 

[0088] FIG. 13 is a flowchart illustrating the processing of the parallel 

program manager 6 in the second embodiment of the invention, which will be explained 
below. 

[0089] In step 1 3 1 , the parallel program manager judges whether to 

generate or remove a parallel program. For generation, the parallel program manager 
goes to step 132. For removal, the parallel program manager goes to step 134. 

[0090] In the step 132, the parallel program manager assigns a program 

identifier that is unique within the system to the parallel processes of the parallel 
program, then goes to step 133. 

[009 1 ] In the step 1 33, if the number of the processes of a parallel 

program to be generated is less than the number of the processors to run the parallel 
program in a time-sharing-based multiplex manner, the parallel program manager 
generates as many dummy processes as required to fill up the difference in number in 
addition to generating the parallel processes of the parallel program in accordance with 
the request for parallel program generation and allocates the generated processes 
including the dummy processes to the processors, then terminates the processing. 
Otherwise, the parallel program manager generates and allocates the parallel processes of 
the parallel program to the processors and terminates the processing. 

[0092] The dummy processes may be those that do nothing but create a 

spin loop. The number of processes to constitute a parallel program is to be specified 
when the parallel program is complied or activated and the number of dummy processes 
to be generated can be determined, depending on the thus specified number of processes. 
The dummy processes are assigned the same identifier as assigned to the parallel 
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processes of the parallel program to be executed simultaneously. 

[0093] In the step 134, the parallel program manager requests all 

processors that have completed the allocated process or dummy process of the parallel 
program to remove the parallel process or dummy process of the parallel program from 
the local queue, then terminates the processing. As a consequence, the parallel processes 
of the parallel program and the dummy processes executed simultaneously with the 
parallel processes will be removed. 

[0094] FIG. 6 is a diagram giving an example of the transition of the 

processes to be executed on each processor over time, wherein a dummy process is 
executed when a plurality of parallel programs, each consisting of the discrete number of 
processes, are executed in a time-sharing-based multiplex manner in the second 
embodiment of the invention. 

[0095] In this example, two parallel programs 71 and 74 run, one program 

71 consisting of two processes Al and A 2 and the other program 74 consisting of one 
process Dl . The processor PI is shared for running the parallel programs 71 and 74. 
The process queues of the processors are as shown in FIG. 9. In the process queue of the 
processor PI , processes Al and Dl are placed. In the process queue of the processor P2, 
process A2 and a dummy process are placed. 

[0096] The processes Al and A2 are assigned identifier x, and the process 

Dl and the dummy process are assigned identifier w, placed in the process queues 3 of 
the processors. Because the processes in the process queues 3 are sequenced by 
identifiers x and w assigned to them, the parallel processes of one of the parallel 
programs are in the corresponding position of order in the process queues of the 
processors PI and P2. In this example, the processes are arranged, according to the 





sequence of identifiers x and w. 



[0097] 



Returning to FIG. 6, at time 61, the process to be executed is 



determined by "(0/100) mod 2 = 0", and, therefore, processes Al and A2 of the parallel 
program 71 are executed on the processors PI and P2, respectively. At time 62, the 
process to be executed is determined by "(100/100) mod 2=1", and, therefore, process 
Dl of the parallel program 74 is executed on the processor 1. At time 63, the process to 
be executed is determined by "(200/100) mod 2 = 0", and, therefore, processes Al and 
A2 of the parallel program 71 are executed again on the processors PI and P2, 
respectively. Moreover, on the processor P2, the dummy process is executed at time 62. 

[0098] During the time-sharing execution of a plurality of parallel 

programs, each consisting of a discrete number of processes, when a processor is not 
executing a parallel process of a parallel program, it executes a dummy process 
simultaneously with the parallel processes of the parallel program in the way described 
above. In this way, the simultaneous run of the parallel processes of one of the parallel 
programs can be scheduled in turn. 

[0099] In accordance with the present invention, as explained above, it is 

feasible to schedule the run of parallel processes without requiring explicit 
processor-to-processor communication, thus reducing the overhead for coordination of 
the processors due to scheduling by coordinating the process-run steps of the processors. 

[0100] The present invention may be embodied in other specific forms 

without departing from its spirit or essential characteristics. The described embodiments 
are to be considered in all respects only as illustrative and not restrictive. The scope of 
the invention is, therefore, indicated by the appended claims rather than by the foregoing 
description. All changes which come within the meaning and range of equivalency of the 
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claims are to be embraced within their scope. 
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